Available online at www.sciencedirect.com 


science (joinect® 
ACADEMIC 


PRESS Virology 315 (2003) 174-183 


VIROLOGY 


www.elsevier.com/locate/yviro 


Common RNA replication signals exist among group 2 coronaviruses: 
evidence for in vivo recombination between animal 
and human coronavius molecules 


Hung-Yi Wu,*” James S. Guy, Dongwan Yoo,“ Reinhard Vlasak,° 
Ena Urbach,”' and David A. Brian®”* 


* Department of Microbiology, College of Veterinary Medicine, University of Tennessee, Knoxville, TN 37996-0845, USA 
> Department of Pathobiology, College of Veterinary Medicine, University of Tennessee, Knoxville, TN 37996-0845, USA 
© Department of Farm Animal Health and Resource Management, College of Veterinary Medicine, 
North Carolina State University, Raleigh, NC 27606, USA 
“ Department of Pathobiology, Ontario Veterinary College, University of Guelph, Guelph, Ontario NIG 2W1, Canada 
° Austrian Academy of Sciences, Institute of Molecular Biology, Department of Biochemistry, Billrothstrasse 11, A-5020 Salzburg, Austria 


Received 4 April 2003; returned to author for revision 16 April 2003; accepted 16 June 2003 


Abstract 


5’ and 3’ UTR sequences on the coronavirus genome are known to carry cis-acting elements for DI RNA replication and presumably 
also virus genome replication. 5’ UTR-adjacent coding sequences are also thought to harbor cis-acting elements. Here we have determined 
the 5' UTR and adjacent 289-nt sequences, and 3’ UTR sequences, for six group 2 coronaviruses and have compared them to each other 
and to three previously reported group 2 members. Extensive regions of highly similar UTR sequences were found but small regions of 
divergence were also found indicating group 2 coronaviruses could be subdivided into those that are bovine coronavirus (BCoV)-like 
(BCoV, human respiratory coronavirus-OC43, human enteric coronavirus, porcine hemagglutinating encephalomyelitis virus, and equine 
coronavirus) and those that are murine hepatitis virus (MHV)-like (A59, 2, and JHM strains of MHV, puffinosis virus, and rat 
sialodacryoadenitis virus). The 3’ UTRs of BCoV and MHV have been previously shown to be interchangeable. Here, a reporter-containing 
BCoV DI RNA was shown to be replicated by all five BCoV-like helper viruses and by MHV-H2 (a human cell-adapted MHV strain), a 
representative of the MHV-like subgroup, demonstrating group 2 common 5’ and 3’ replication signaling elements. BCoV DI RNA, 
furthermore, acquired the leader of HCoV-OC43 by leader switching, demonstrating for the first time in vivo recombination between animal 
and human coronavirus molecules. These results indicate that common replication signaling elements exist among group 2 coronaviruses 
despite a two-cluster pattern within the group and imply there could exist a high potential for recombination among group members. 
© 2003 Elsevier Inc. All rights reserved. 
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Introduction serologic relatedness but more recently on the basis of 


genome sequence similarities (Cavanagh et al., 1997; Lai 


The coronavirus genus in the family Coronaviridae has 
been divided into three groups, initially on the basis of 
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and Cavanagh, 1997; Siddell, 1995). The recently described 
human severe acute respiratory syndrome (SARS) corona- 
virus appears not to fall precisely into any of these groups 
(Marra et al., 2003; Rota et al., 2003). Members of group 1 
share identity with the porcine transmissible gastroenteritis 
virus (TGEV) and [officially, as of 2000 (Enjuanes et al., 
2000)] include TGEV [and strain porcine respiratory coro- 
navirus (PRCoV)], canine coronavirus (CCoV), feline coro- 
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navirus (FECoV) [and strain feline infectious peritonitis 
virus (FIPV)], human _ respiratory coronavirus-229E 
(HCoV-229E), and porcine epidemic diarrhea virus 
(PEDV). Members of group 2 share identity with the mouse 
hepatitis coronavirus (MHV) and include bovine coronavi- 
rus (BCoV), human respiratory coronavirus-OC43 (HCoV- 
OC43), porcine hemagglutinating encephalomyelitis coro- 
navirus (HEV), rat coronavirus (RtCoV) [and strain 
sialodacryoadenitis coronavirus (SDAV)] (Enjuanes et al., 
2000). Members of group 3 share identity with the avian 
infectious bronchitis virus (IBV) and include IBV and tur- 
key enteric coronavirus (TCoV) (Enjuanes et al., 2000; Guy, 
2000). Coronaviruses likely to become officially classified 
as group 2 members on the basis of antigenic relatedness 
and genome sequence similarities include human enteric 
coronavirus-4408 (HECoV-4408) (Zhang et al. 1994), 
puffinosis virus (PV) (Klausegger et al., 1999), and equine 
enteric coronavirus (ECoV) (Guy et al., 2000). 

Recombination among coronaviruses is an attribute of 
the genus and is thought to contribute to the emergence of 
new pathotypes (Lai, 1992; Lai and Cavanagh, 1997). Cu- 
riously, recombination so far has been noted only between 
species of the same group, for example, between the feline 
and canine enteric coronaviruses (Herrewegh et al., 1998), 
between strains of MHV (Keck et al., 1988), and between 
strains of IBV (Jia et al., 1995; Kottier et al., 1995), al- 
though in principle intergroup recombination is possible 
based on reverse genetics experimentation in which an 
MHV chimera was produced with the ectodomain portion of 
the FeCoV spike protein gene (Kuo et al., 2000). Inasmuch 
as natural recombination events among coronaviruses prob- 
ably result from a polymerase jumping mechanism during 
coinfection (Brian and Spaan, 1997; Lai, 1992; Lai and 
Cavanagh, 1997), signals read by the RNA polymerase 
complex to initiate and carry out genome replication are 
likely to contribute to the recombination process. It would 
therefore be expected that viruses sharing signals for RNA 
polymerase recognition would be more likely to undergo 
recombination than those that do not. Characterization of 
common signals for RNA synthesis, therefore, may aid in 
predicting which viruses are likely to recombine during a 
natural coinfection. 

To date, common 3’-proximal replication signaling ele- 
ments have been noted between MHV and BCoV, both 
group 2 members for which DI RNAs have been studied. In 
an MHV-A59-derived DI RNA, which comprised only the 
virus genomic termini, and in the MHV-A59 genome, it has 
been shown that the entire 3’ UTR of BCoV is able to 
replace the equivalent region without loss of DI RNA rep- 
licability or virus viability (Hsue and Masters, 1997; Hsue et 
al., 2000). Likewise, in a BCoV DI RNA that comprised 
only the virus genomic termini (Fig. 1; Chang et al., 1994) 
it has been shown that the MHV-A59 3’ UTR was able to 
replace the BCoV 3’ UTR with no detectable loss of repli- 
cating ability (S. Ku, G.D. Williams, and D.A. Brian, un- 
published data). The partially characterized 3’ UTR-map- 
ping cis-acting elements in BCoV and MHV include a 
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Fig. 1. Structure of the bovine coronavirus DI RNA and pDrep1 as they 
relate to the parent bovine coronavirus genome. Note that 289 nts of ORF 
la and the entire N ORF are part of the DI RNA. The reporter is a 30-nt 
sequence derived from the TGEV N gene that was cloned in-frame into the 
Bglll site of the BCoV N gene (Chang et al., 1994). 


5'-proximal bulged stem-loop (Hsue and Masters, 1997; 
Hsue et al., 2000), a 5’-proximal hairpin-type pseudoknot 
(Williams et al., 1999), a centrally located stem-loop (Liu et 
al., 2001), and a 3’-terminal poly(A) tail (Spagnolo and 
Hogue, 2000). What roles the higher order structures play 
are not clear since the 3’-terminal 55-nts and poly(A) tail 
are thought to be the minimal sequence requirements for 
minus-strand synthesis (Lin et al., 1994). Common 5’-prox- 
imal replication signals between MHV and BCoV have not 
yet been described. For both MHV and BCoV DI RNAs it 
has been shown that the 5’-terminal sequence is required for 
DI RNA replication (Chang et al., 1994; Kim et al., 1993) 
and in BCoV DI RNA at least two stem-loops identified as 
stem-loops III and IV function as higher order cis-acting 
signaling elements (Raman et al., 2003; S. Raman and D. 
Brian, unpublished data). A higher order cis-acting structure 
mapping within the first 290 nts of ORFI has also been 
found (C. Brown, K. Nixon, S. Senanayake, and D. Brian, 
unpublished data). Might these elements be common be- 
tween MHV and BCoV as well? To what extent might 5’- 
and 3’-proximal replication signals be shared among other 
group 2 members? 

Here we examine the 5’ UTR and adjacent 289-nt se- 
quences of four established and two new putative group 2 
coronavirus members, the 3’ UTR sequences of four estab- 
lished and one new putative member, and compare them 
with previously reported sequences of three other group 2 
coronaviruses. Our results suggest that the 5’ and 3’ UTR, 
and 5' UTR-adjacent sequence patterns, may be subgrouped 
into those that are BCoV-like and those that are MHV-like. 
This bifurcated pattern is consistent with a proposed sub- 
grouping of group 2 coronaviruses on the basis of activities 
of the hemagglutinin-esterase proteins (Wurzer et al., 2002). 
The diverged UTR patterns, therefore, raised the question as 
to whether the two subgroups of group 2 coronaviruses 
might differ with respect to their RNA replication signals as 
measured by replication assays. To begin to answer this, we 
tested replication in trans of a BCoV DI RNA with various 
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group 2 helper viruses and learned that replication was 
supported by all five viruses within the BCoV-like sub- 
group, and by MHV-H2, the only virus tested within the 
MHV-like subgroup. It was also found that the BCoV DI 
RNA acquired the leader sequence of HCoV-OC43 helper 
virus, thereby demonstrating for the first time an in vivo (in 
this case, in cell-culture) recombination event between mol- 
ecules of an animal and human coronavirus. These results 
indicate that the signals for recognition and engagement by 
the RNA replication machinery are probably common 
among group 2 coronaviruses, regardless of subgroup clas- 
sification, and indicate a potential for recombination among 
members of the group should mixed infections occur. 


Results 


Nucleotide sequence analyses of the 5' UTR and adjacent 
289 nts, and 3' UTR, of the group 2 coronaviruses reveal 
a two cluster pattern 


Structural elements acting as cis-acting replication sig- 
nals in genomic and DI RNAs have been found in the 5’ 
UTR and adjacent 289 nts, and in the 3’ UTR, of MHV and 
BCoV (reviewed under Introduction above). To examine 
potentially conserved features in these regions among the 
group 2 coronaviruses, the 5’ UTR sequences, 5’ UTR- 
adjacent 289 nts, and 3’ UTR sequences of HCoV-OC43, 
HECoV-4408, HEV-TN11, ECoV-NC99, and SDAV were 
determined and compared with the published sequences for 
BCoV-Mebus, MHV-A59, MHV-2, and MHV-JHM (see 
GenBank Accession Nos. described below and Figs. 2 and 
3). For PV, the 3’ UTR was also compared (Fig. 3). Upon 
alignment it became apparent by inspection that there were 
several regions, up to 18 nts in length in the 5’ UTR and 
adjacent 289 nts and up to 29 nts in the 3’ UTR, that were 
identical among all the viruses examined (identified by 
shading in Figs. 2 and 3). It was likewise apparent that there 
were several smaller regions in which sequences diverged 
considerably, yielding patterns that would suggest the vi- 
ruses could be grouped into those that are either BCoV-like 
or MHV-like. The existence of a two-cluster pattern was 
confirmed in separate phylogenetic analyses of the 5’ and 3’ 
UTRs and of the amino acid sequences in the partial ORF 
adjacent to the 5'UTR (Figs. 4A-C). 


Helper virus from either cluster can support the 
replication of BCoV DI RNA in HRT cells 


It has been previously shown that the 3’ UTRs of MHV 
and BCoV are functionally interchangeable in the context of 
the MHV genome (Hsue and Masters, 1997; Hsue et al., 
2000) and BCoV DI RNA (S. Ku, G.D. Williams, and D.A. 
Brian, unpublished data), indicating the presence of com- 
mon 3’-proximal replication signals. Common 5’ proximal 
RNA replication signals have not been similarly sought. To 
test for the presence of common replication signals, the 


accumulation of reporter-containing BCoV DI RNA (tran- 
scripts of pDrep1; Fig. 1) was tested with five of the BCoV- 
like and one of the MHV-like group 2 helper viruses de- 
scribed (Figs. SA—-C). For this, HRT-18 cells were mock 
infected or infected with BCoV-Mebus, HCoV-OC43, 
ECoV-NC99, HECoV-4408, HEV-TN, or MHV-H2, trans- 
fected 1 h later with pDrep! RNA, and total RNA from cells 
was extracted at 1, 24, and 48 h posttransfection, and at 24h 
after infection with passage 1 virus (and in some cases 
passage 2 virus) for measurement of DI RNA accumulation. 
Accumulation was measured by Northern blotting using a 
pDrep1 reporter-specific radiolabeled probe. In the absence 
of helper virus, transfected DI RNA has a measured half-life 
of less than 4 h and no accumulation is observed at 24 h 
posttransfection (Chang et al., 1994, 1996; Senanayake and 
Brian, 1999; data not shown). The results for the BCoV-like 
viruses shown in Figs. 5A and C illustrate a net accumula- 
tion of progeny DI RNA molecules by 24 and 48 h post- 
transfection, and by 24 h after virus passaging (Fig. 5A, 
lanes 1—4), establishing that common replication signals are 
recognized by the BCoV-like coronaviruses tested. It should 
be noted that whereas helper virus replication rates for the 
viruses other than BCoV were less than that for BCoV at 24 
hpi through VP1 [ranging from 1 to 50%, as measured by 
reprobing the stripped blot for viral mRNAs with a univer- 
sally sensitive 3’ UTR-detecting probe (3’UTRPSEU, Table 
1)] (Figs. 5A, B, and D), BCoV DI RNA replication mea- 
sured at the same time points was relatively higher (ranging 
from 5 to 100%) (Figs. SA—C). While the quantitative mea- 
surements made here are not precise enough to establish the 
relative efficiencies of DI RNA replication by the various 
helper viruses, or the relative inhibitory effects on virus 
replication by the DI RNA, they do establish a helper virus 
function for DI RNA replication in each case. 

In previous studies on the regulatory effects of BCoV 
genomic 5’ UTR sequence on translation (Senanayake and 
Brian, 1999) it was learned that whereas BCoV replicated in 
murine OST-7 cells [a T7 RNA polymerase-expressing 
mouse L cell-derived line (Elroy-Stein and Moss, 1990)] 
nearly as well as in human rectal tumor cells [HRT-18 cells 
used for BCoV propagation (Chang et al., 1994)] the rep- 
lication of transfected pDrep1 transcripts could not be ob- 
served at all with BCoV helper virus in the murine cells. An 
attempt was therefore made to test for the helper effect of 
MHV in HRT cells. For this we used the MHV-H2 strain 
that had been selected as a host range mutant of a MHV- 
A59/JHM recombinant on persistently infected BHK cells 
now able to grow on human cells (Baric et al., 1997). In 
MHV-H2-infected HRT cells, pDrep1 RNA accumulated as 
well as in HCoV-OC43 and ECoV-NC99 helper virus in- 
fected cells through 48 h posttransfection (20-40% of 
BCoV helper virus levels vs 15—40%, respectively; Fig. 5B, 
lanes 1-4), but not as well after virus passage 1 (5% of 
BCoV helper virus levels vs 40—100%: Fig. 5B, lanes 5 and 
6, and Fig. 5C). These results indicate that MHV can serve 
as a helper virus for the replication of BCoV DI RNA and 
thus there are 5’-proximal as well as 3’-proximal cis-acting 
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BCoV GAUUGUGAGCGAUUUGCGUGCGUGCAU CCCGCUUCACD GAUCUCUUGUUAGADCUUUU UaUAAUCUAAACUUDAUAAAAACAUCCACUCCCUGUAU UCUAUGCUUGUGGGC 
0c43 GAUUGUGAGCGAUDUGCGUGCGUGCAU CCCGCcUUCACD GAUCUCUUGUUAGAUCUUUU UgUAAUCUAAACUUUAUAAAAACAUCCACUCCCUGUAa UCUAUGCUUGUGGGC 
HECoV GAUUGUGAGCGAUUUGCGUGCGUGCAU CCCGCUUCACU GAUCUCUUGUUAGADCUUUU UaUVAAUCUAAACUDUAUAAAAACAUCCACUCCCUGUAg UCUAUGCCUGUGGGE 
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MHVA UAUAAGAGUGAUUGGCGUCCGUACGUACCCUCUCaACUCUAAAACUCUUG UAGUDUAAA UCUAAUCUAAACUUUAUAAACGG CACUUCCUGCGUGUCCAUGCCcGcGGGC 
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BCoV GUAGAUUUUUCAUAGUGGUGUCUAUAUUCADUUCUGCUGUUAACAGCUUUCAGCCAGGGACGUGUUGUAUCCUAGGCAGUGGCCCACCCAUAGGUCAC AAUGUCGAAGAUCAACAAAU 
0C43 GUAGAUUUUUCAUAGUGGUGUUVAUAUUCADUUCUGCUGUUAACAGCOUUCAGCCAGGGACGUGUUGUAUCCUAGGCAGUGGCECCgCCCAUAGGUCAC AAUGUCGAAGAUCAACAAAU 
HECoV GUAGAUUUUUCAUAGUGGUGUCUADAUDCAUDUCUGCUGUUAACAGCODUCAGCCAGGGACGUGUUGUAUCCUAGGCAGUGGCCCACCCAUAGGUCAC AAUGUCGAAGAUCAACAAAU 
HEV GUAGAUUUUUCAUAGUGGUGUCUADAUUCAQUUCUGCUGUUAACAGCOUDUCAGCCAGGGACGUGUUGUAUCCUAGGCAGUGGECCACCCAUAGGUCAGC AAUGUCGAAGAUCAACAAAU 
ECoV uVAGACUUUUCAUAGUGGUGUUUgDAUDaAUUUCUaCUGUUAACAaCUcUCAGCCAGGGACGUGUUGUAUCCUAGuUCAGUGGCCCuCCCAUAGGUCAC AAUG GCGAAGAUCAACAAAU 


MHVA CUGGUCUUGUCAUAGUGCUGACAUUUGUaGUUCCUUGA CUUUCGUUCUCUGCCAGUGACGUGUCCAUUCGGCGCCAGCAGCCCACCCAUAGGUUGCAUAAUGGCAARAGAUGGGCAAAU 
MHV2 uUGGUCUUGUCAUAGUGCUGACAUDUGUGGUUCCOUGg uUUUUGUUCUCUGCCAGUGACGUGUCCAUUCGGCGECCAG CAGCCCACCCAUAGGUUGCAUAAUGGCAAAGAUGGGCAAAU 
MHVJ CUGGUCUUGUCAUAGUGCUGACAUDUGUGGUUCCUUGA CUDUCGUUCUCUGCCAGUGACGUGUCCAUUCGGCECCAGCAGCCCACCCAUAGGUUGCAUAAUGGCARAGAUGGGCAAAU 
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BCoV ACGGUCUCGAACUACACUGGGCUCCAGAAUUUCCAUGGAUGU UUGAGGACGCAGAGGAGAA GUUGGAUAACCCUAGUAGUUCAGAGGUGGAUAUAGU aUGCUCCACCACUGCGCAAAAGC 
0C43 ACGGUCUCGAACUACACUGGGCUCCAGAAUUUCCAUGGAUGUUUGAGGACGCAGAGGAGAA GUUGGAUAACCCUAGUAGUUCAGAGGUGGAUAUg aUUUGCUCCACCACUGCGCAAAAGC 
HECoV ACGGUCUCGAACUACACUGGGCUCCAGAAUUUCC AUGGAUGU UUGAGGACGCAGAGGAGAA GUUGGA cBACCCUAGUAGUUCAGAGGUGGAUAUAGU aUGCUCCACCACUGCGCAAAAGC 
HEV ACGGUCUCGAACUACACUGGGCUCCAGAAUUUCCAUGGAUGUUUGAGGACGCAGAGGAGAA GUUGGAUAACCCUAGUAGUUCAGAGGUGGAUAUVAGUUUGCUCCACCACUGCGCAAAAGC 
ECoV ACGGUCUCGAcCUgCAaUGGGCUCCAGAAUUUCC g UGGAUGUUUGAGGAC a CAGAGGAGAAGUUGGAUAACCCUAGUGG gUCAGAGGUGG gUAUAaUUUGCCECACCACUGCGCAAGAGU 
MHVA ACGGUCUCGGCUUCAAAUGGGCCCCAGAAUUUCC AUGGAUGCUUCCGAACGCAUCGGAGAA GUUGGGUAACCCUGAGAGGUCAGAGGAGGAUGGGUDUNGECCCCUCUGCUGCGCAAGAAC 
MHV2 ACGGUCUCGGCUUCAARAUGGGCCCCAGAAUUUCCAUGGAUGCUUCCGAACGCAUCGGAGAA GUUGGGUA gCCCUGAGAG GUCAGAGGAGGAUGGGUUUUGCCCCUCUGCUGCGCAAGAAC 
MHVJ ACGGUCUCGGCUUCABAUGGGCCCCAGAAUUUCC AUGGAUGCUUCCGAACGCAUCGGAGAA GUUGGGUAACCCUGAGAG GUCAGAGGAGGAUGGGUDUUGCCCCUCUGCUGCGCAAGAAC 
SDAV AC 
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BCoV UGGAARACAGGCGGAAUUUGUCCUGAAAAUCAUGUGAUGGUGGAUUGU CGCCGACUUC UUAAACAAGAGUGUUGUGUGCAGUCUA GCCUAAVACGUGAAAUUGUUAUGAAUACACGUCCAU 
0C43 UGGAAACAGACGGAAUUUGUCCUGAAAAUCAUGUGAUGGUGGAUUG cCGCCGACUUCUUAAACAAGAGUGUUGUGUGCAGUCUAGCCUAAUACGUGAAAU UGUUAUGAAUgCAaGUCCAU 
HECoV UGGAAACAGGCGGAAUUUGDCCUGAAAAUCAUGUGAUGGUGGAUUGUCGCEGACUUCUUAAACAAGAGUGUUGUGUGCAGUCUAGCCUAAUACGUGAAAU UGUUAUGAAUVACACGUCCAU 
HEV UGGAAACAGGuGGAA UUUGUCCUGAAAAUCAUGUGAUGGUGGAUUGUCGCCGACUUCUUAAACAAGAGUGUUGUGUGCAGUCUAGCCUAAUACGUGAAAU UGUUAUGAAUACACGUCCAU 
ECoV UGGgAACAuGuaGAAUVac0uuUGAgAAUCAUGUGAUGGUUGAUUGUCGECGc = CUcAuuCAAGAGgGUUGUGUGCAGUCUAaCCUgGAUACGUGAAAD aaagAUGAAUACACGUCCuU 
MHVA CGAAAGUUAAAGGAAAAACOUUUGGUUAAUCACGUGAGGGUGa AUUGUAGCCG GCUUCCAGC LUUGGAaUGCUGUGUUCAGUCUGCCAUAAUCCGUGAUAU UUUUGUaGAUGAGGAUCCCC 
MHV2 CaAARacUAAAGGAAAAACUUUGAUUAAUCACGUGAGGGUGGAUUGUAGCCEGCUUCCAGC aUUGGAGUGCUGUGUUCAGUCUGCCAUAAUCCGUGAUAU UNUUGUUGAUGAGGAUCECu 
MHVJ CGAAAGUUAAAGGAAAAACUUUGGUUAAUCACGUGAGGGUGGAUUGUAGCCG GCUUCCAGC cCUUGGAGUGCUGUGUUCAGUC cGCCAUAAUCCGUGAUAD UUUUGUUGA cGAGGAUCECC 
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HEV MSKINKYGLELHWAPEFPWM FEDAEEKLDNPSSSEVDIVCSTTAQKLETGGIC PENHVMVDCRRLLKOECCVOSSLIREIVMNTRPYDLEVLLODA 
ECoV MAKINKYGLDLOWAPEFPWMFEDTEEKLDNPSGSEVGIICPTTAQELGTCRITFENHVMVDCRR LIQEGCVOSNLIREIKMNTRPYDMDRVIQDA 
MHVA MAKMGKYGLG FKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGKTLVNHVRVNCSRLPALECCVQSAI IRDIFVDEDPQKVEASTMMA 
MHV2 MAKMGKYGLGFKWAPEFPWMLPNASEKLGSPERSEEDGFCPSAAQEPKTKGKTLINHVRVDCSRLPALECCVQSAI IRDIFVDEDPLNVEASTMMA 
MHVJ MAKMGKYGLGFKWAPEFPWMLPNASEKLGNPERSEEDGFCPSAAQEPKVKGKTLVNHVRVDCSRLPALECCVQSAI IRDIFVDEDPQKVEASTMMA 
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Fig. 2. Aligned 5’-terminal sequences of group II coronaviruses. (A) Aligned 5’ UTR and adjacent 289-nt (partial ORF 1a) sequences. The alignment was 
constructed to minimize inserted gaps; only unambiguously aligned nucleotide positions in the UTR (identified by 1s in the MASK) were used to calculate 
the phylogenetic tree (shown in Fig. 4). Identical regions among all nine group 2 coronaviruses are noted by shading. Note that the SDAV sequence is known 
for only 21 nts into ORF la, so sequences of only eight viruses are shaded beyond this point. Positions of the BCoV leader sequence (65 nts) and of BCoV 
stem loops I, II, II and IV, of which stem loops III and IV are phylogenetically conserved among group 2 coronaviruses (Chang et al., 1996; Raman et al., 
2003; Raman et al., unpublished data), are noted. Numbers above refer to positions in the sequence alignment. Lowercase letters identify nonconforming 
nucleotides within the respective BCoV-like or MHV-like subgroups. Part of the pDrep1 5’-terminal sequence is shown to indicate its point of divergence 
from BCoV genome sequence. (B) Aligned amino acid sequences within the partial ORF la. Only unambiguously aligned amino acid positions (identified 
by ls in the MASK) were used to calculate the phylogenetic tree (shown in Fig. 4). Numbers above refer to positions in the sequence alignment. 
Abbreviations: BCoV, bovine coronavirus-Mebus; OC43, human coronavirus-OC43; HECoV-OC43, human enteric coronavirus-4408; HEV, porcine 
hemagglutinating encephalomyelitis virus-TN11; ECoV, equine coronavirus-NC99; MHVA, mouse hepatitis virus-A59; MHV2, mouse hepatitis virus-2; 
MHVJ, mouse hepatitis virus-JHM; SADV, rat sialodacryoadenitis virus. 
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Fig. 3. Aligned 3’ UTR sequences of group II coronaviruses. The alignment was constructed to minimize inserted gaps; only unambiguously aligned 
nucleotide positions (identified by 1s in the MASK) were used to calculate the phylogenetic tree (shown in Fig. 4). Shaded areas identify sequences identical 
among all 10 viruses. Numbers above refer to the sequence alignment. The underlined UAA stop codon of the N ORF begins the 3’ UTR and a poly (A) 
tail, identified as (A)n, terminates the sequence. Sequences making up a cis-acting bulged stem-loop (Hsue and Masters, 1997), a pseudoknot (Williams et 
al., 1999), a long-range stem interaction (identified as a and b) (Liu et al., 2001), and a conserved GGAAGAGC octomer, are bracketed. Lowercase letters 
indicate nonconforming nucleotides within the respective BCoV-like or MHV-like subgroups. Abbreviations are as described in Fig. 2, with the addition of 


PV, puffinosis virus. 


replicating signaling elements in BCoV DI RNA recognized 
by the MHV RNA replication apparatus. RNA replication 
signals are therefore likely to be common between the two 
viruses. The relatively low abundance of BCoV RNA in 
VP1 and VP2 of MHV-infected cells (Fig. 5B, lane 5) 
suggests, furthermore, that MHV does not package the 
BCoV DI RNA as well as do the BCV-like helper viruses. 
A difference in migration rates between marker DI RNA 
(Fig. 5B, lane 1) and intracellular DI RNA (Fig. 5B, lanes 
2-6) has been previously observed (Chang et al., 1994; 
Senanayake and Brian, 1999) and apparently represents an 
in-gel displacement of intracellular DI RNA by abundant 
comigrating cellular RNA species. 


Leader switching from the HCoV-OC43 helper virus to 
BCoV DI RNA 


Leader switching was first described between an MHV- 
JHM-derived DI RNA and MHV-A59 (Makino and Lai, 
1989). Leader switching is thought to result from a high- 


frequency recombination step that is similar, but not neces- 
sarily identical to, the RdRp jump by which leader is ac- 
quired on coronavirus subgenomic mRNAs (Brian and 
Spaan, 1997; Chang et al., 1996; Makino and Lai, 1989; 
Zhang and Lai, 1996). It was therefore anticipated that if 
replication of BCoV DI RNA resulted from the use of a 
heterologous group 2 coronavirus that the progeny DI RNA 
would acquire the leader of the helper virus. This possibility 
was sought on DI RNA progeny in cells infected with all 
helper viruses except HECoV, which has a leader sequence 
indistinguishable from that of BCoV. Leader switching was 
observed for HCoV-OC43 (four of four cDNA clones in 
VPI RNA), ECoV (three of four clones), and HEV (three of 
four clones), but not for MHV (zero of eight cDNA clones 
examined and data not shown). In the sequence data from 
one of the VP1 cDNA clones from HCoV-OC43 helper 
virus (Fig. 5E) it can be seen that the recombination site 
occurred somewhere between HCoV-OC43 marker nt A95 
and BCoV marker nt C133, slightly downstream of a high- 
frequency leader switching window (nts 70—93) previously 
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Fig. 4. Phylogenetic relationships inferred from coronavirus sequences: (A) 
5’ UTR (145 nucleotide positions), (B) 3’ UTR (260 nucleotide positions), 
and (C) partial 5’ ORF (95 amino acid positions). The phylogentic frame- 
works in A and B were inferred from nucleic acid sequences by maximum 
likelihood (-In L = 404.60992 and 539.07673 for A and B, respectively); 
the framework in C was inferred from translated amino acid sequences by 
a distance method. Paired numbers represent bootstrap proportions calcu- 
lated by distance (top numbers) and parsimony (bottom numbers) methods. 
Scale bar units indicate nucleotide (A and B) or amino acid (C) substitutions 
per sequence position. Abbreviations are as described in Figs. 1 and 2. 


identified in BCoV (Chang et al., 1996). These results con- 
firm and extend our understanding of the high-frequency 
nature of leader switching by strains of group 2 coronavi- 
ruses and document for the first time a recombination event 
between human and animal coronavirus molecules. 


Discussion 


The results of this study show first a distinct subdivision 
of group 2 coronaviruses into two subgroups, those that are 
BCoV-like and those that are MHV-like, with regard to 
sequence patterns in the 5’ and 3’ UTRs (Figs. 2 and 3). The 
significance of this divergence, however, is not clear since 
the replication apparatus of one virus will replicate the DI 
RNA of another, at least in the one-way tests employed 
here. Although we cannot judge the relative efficiencies of 
the helper viruses used since replication rates differed in the 
HRT cells, we think it quite likely that UTR sequence 
differences reflect a fine-tuning of structural interactions 
between each virus genome and its cognate replication ap- 
paratus for optimal replication. Since previous studies had 
shown that the 3’ terminus is interchangeable between the 
BCoV and MHV 3’ UTRs (Hsue and Masters, 1997; S. Ku, 
G.D. Williams, and D.A. Brian, unpublished data), the new 
information derived from this study is that there are also 
shared signals in the genome 5’ terminus. It remains to be 
determined what explains the lower rate of accumulation 
and packaging for BCoV DI RNA with MHV-H2 helper 


virus as compared to the BCoV-like helper viruses. It also 
remains to be determined what precise signaling elements 
are shared among the members of the group. 

It has been found in a number of coronavirus DI RNA 
systems that DI RNA upon replication takes on the leader of 
the helper virus. The same was found here for the BCoV DI 
RNA and all helper viruses with a distinguishable leader 
sequence except for MHV. The absence of leader switching 
with MHV may reflect the relatively poor replication of 
MHV-H2 in BCoV DI RNA-transfected cells or it may 
possibly reflect other mechanistically incompatible features. 
The data showing leader switching with HCoV-OC43 are, 
to our knowledge, the first to document an in vivo recom- 
bination event between molecules of a human and animal 
coronavirus. Coronavirus leader switching has been com- 
pared mechanistically to RdRp strand switching that gives 
rise to leader-containing subgenomic transcripts (Chang et 
al., 1996; Makino and Lai, 1989; Zhang and Lai, 1996). A 
demonstration of leader switching therefore raises the ques- 
tion of whether the heterologous helper viruses can support 
synthesis of subgenomic mRNAs from the BCoV DI RNA 
template as has been previously observed in the BCoV 
(Krishnan et al., 1996; Ozdarendeli et al., 2001) and MHV 
(Makino et al., 1991; van der Most et al., 1994) homologous 
systems. Preliminary experiments indicate that they do 
(H.-Y. Wu and D.A. Brian, unpublished data), but that the 
patterns of transcription differ among the helper viruses, sug- 
gesting a different RdRp mechanism and hence probably dif- 
ferent signaling processes are at play between the processes of 
transcription and replication (H.-Y. Wu, A. Ozdarendeli, and 
D. Brian, unpublished data). A mechanistic separation of tran- 
scription and replication has been noted in arteriviruses (van 
Dinten et al., 1997), a family placed along with coronaviruses 
in the order Nidovirales (Cavanagh et al., 1997). 

The results shown here establish that despite sequence 
differences in the 5’ and 3’ UTRs and in the first 289 nts 
within ORF 1 of group 2 coronaviruses, the replication 
apparatuses of these viruses recognize and replicate the 
BCoV DI RNA, indicating the recognition of a common set 
of replication signals within the group. The results suggest, 
furthermore, that as a consequence of common replication 
signals there exists a high potential for recombination 
among members of group 2. To what extent common rep- 
lication signals exist among the coronavirus groups, includ- 
ing the recently described SARS coronavirus (Marra et al., 
2003; Rota et al., 2003), remains to be determined. Based on 
the results presented here, such information would seem 
useful in predicting the recombination potential among 
coronaviruses. 


Materials and methods 


Cells and viruses 


The human adenocarcinoma (human rectal tumor) cell 
line HRT-18 (Tompkins et al., 1974) was used as previously 
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Fig. 5. Replication of BCoV DI RNA by the helper virus functions of group 2 coronaviruses. (A) Northern blot analyses showing replication (accumulation) 
of reporter-containing BCoV DI RNA in the presence of BCoV, HCoV-OC43, ECoV, HECoV, and HEV, and accumulation of mRNA 7 as an indicator of 
helper virus replication. RNA was extracted at the indicated times posttransfection, or at 24 h after virus passaging (VP1). Lanes 1—4, blots were probed with 
a DI RNA reporter-specific radiolabled probe. Lanes 5-8, blots used in lanes 1—4 were probed with a 3’ UTR-specific probe that recognizes an identical 
sequence in all the helper viruses. (B) Northern blot analyses showing replication (accumulation) of reporter-containing BCoV DI RNA in the presence of 
BCoV and MHV-H2 helper viruses, and accumulation of mRNA 7 as an indicator of helper virus replication. RNA was extracted at the indicated times 
postinfection, or at 24 h after virus passaging (VP! and VP2). Lanes 1—8, blots were probed to detect DI RNA only as were lanes 1—4 in (A). Lanes 9-16, 
blots used in lanes 1—8 were stripped and probed to detect viral mRNAs as were lanes 5—8 in (A). Lanes | and 9, pDrep1 marker transcript. RNA from 
mock-infected (uninf) cells (lanes 7 and 15), or infected, mock-transfected (inf) cells (lanes 8 and 16), was harvested at 24 h post-mock-infection and 
postinfection; respectively. (C) Bar graphs illustrating the amounts of BCoV DI RNA accumulated relative to amounts accumulated with BCoV helper virus 
as measured in (A) and (B). (D) Bar graphs illustrating the amounts of helper virus mRNA 7 relative to amounts of BCoV mRNA 7 as measured in (A) and 
(B) (E). Sequence analysis of the leader fusion region of DI RNA progeny from cells infected with HCoV-OC43 helper virus and transfected with 
reporter-containing BCoV DI RNA. The HCoV-OC43-specific bases G60 and A95 and the BCoV-specific base C133 show that the progeny DI RNA 
molecule possesses a leader derived from the HCoV-OC43 helper virus and that the recombination site exists somewhere between nts 95 and 133. 


described for BCoV (Lapps et al., 1987; Chang et al., 1994) 
for the growth of the BCoV-like and MHV-H2 viruses. 
BCoV-Mebus was plaque purified three times (Lapps et al., 
1987) and a DI RNA-free stock (Williams et al., 1999) was 
used. HCoV-OC43 (Hogue and Brian, 1986) obtained from 
S.R. Weiss (Univ. of Pennsylvania, Philadelphia, PA), HE- 
CoV-4408 (Zhang et al., 1994), obtained from X.M. Zhang 
(Univ. of Arkansas, Little Rock, AR), HEV-TN11 obtained 


from John Black (C.E. Kord Animal Disease Laboratory, 
Nashville, TN), and ECoV-NC99 (Guy et al., 2000) were 
each plaque purified three times and used to prepare stocks 
at concentrations of >10’ PFU/ml. MHV-H2, a strain 
adapted for growth on human epithelial cells (Baric et al., 
1997), was obtained from R.S. Baric (Univ. of North Caro- 
lina, Chapel Hill, NC) and used directly to prepare a stock 
at a concentration of >10’ PFU/ml. L2 (Percy) cells were 
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Table 1 
Oligonucleotides used in this study 


Oligonucleotide* Polarity Sequence (5'—3") Nt binding region 
3’UTRPSEU(+) = CTGCAAGTCATCCATTCTGATAGAGAGTG nts 63-91 in BCoV 3’ UTR? 
S5RAAP(—)° (+) GGCCACGCGTCGACTAGTACGGGGGGGGGG 3' poly(C) on cDNA 
Ada3(+) = GGAATTCTCGAGCTCAAGCTTTTTTTTTTTITTTT 3’ poly(A) tail 

DI3'(+) _ CGGGATCCGTCGACACGCGTTTTTITTTTTTTITITTITTT 3’ poly (A) tail 

DI420(—) + GTTGTGTGCAGTCTAGCCTAATAC nts 419-442 in pDrep1 
Apal(+) = GGCTGAAAGCTGTTAACAGCAGAAATG nts 140-166 in pDrep1 
Ta1096(+) ad CGCACAACGTGCCATGCCAC nts 1096-1215 in BCoV genome® 
Leader(—) + GAGCGATTTGCGTGCGTGCATCCCGC nts 7-32 in pDrep1 

Ndel(+) = CCTCCAAATCATATGGACGTGTATTC nts 456-481 in pDrep1 
Pst(—) oF GAGAGTTGACGCAGAGGAC nts 1799-1818 in pDrep1 
TGEV8(+) = CATGGCACCATCCTTGGCAACCCAGA nts 1098-1123 in pDrep1 


“ The positive and negative symbols in the oligonucleotide names indicate the polarity of the nucleic acid to which the oligonucleotide anneals. 
© The sequence in nts 63-91 in BCoV is common to all the group 2 coronaviruses examined in this study (note Fig. 3). 


© Supplied with the RACE kit (Invitrogen). 


“From BCoV-Mebus genome sequence as determined by K. Nixon and D. Brian (GenBank Accession No. U00735). 


used to grow SDAV-681 as previously described (Yoo et 
al., 2000) and mouse L cells were used to grow PV as 
previously described (Klausegger et al., 1999). 


Oligonucleotides 


The oligonucleotides used in this study are described in 
Table 1. 


Genome sequence determination 


To determine the 5’ proximal genome sequences of 
HCoV-OC43, HECoV-4408, HEV-TN11, and ECoV- 
NC99, RNA was extracted with Trizol (Life Technologies) 
from infected cells at 24 hpi and cDNAs were synthesized 
from RNA using RT and oligonucleotide Ial096(+). 
cDNAs were amplified by PCR using oligonucleotides 
Ta1096(+) and leader(—); the products were cloned into the 
TOPO XL vector (Invitrogen), and resulting plasmid DNAs 
were sequenced by automated dideoxynucleotide sequenc- 
ing using oligonucleotides leader(—), DI420(—), and 
Ia1096(+) as sequencing primers. To determine the 5’ ter- 
minal genome sequences, RACE was used as recommended 
by the manufacturer of the RACE kit (Invitrogen). For this, 
first-strand cDNA was made with oligonucleotide NdeI(+), 
and the product was C-tailed and then PCR amplified with 
oligonucleotides Hpal(+) and 5RAAP(-—), the bridge an- 
chor primer supplied with the kit. The products were cloned 
into the TOPO XL vector and the resulting plasmids were 
sequenced by automated sequencing using oligonucleotide 
HPal(+) and as sequencing primer. 

To determine the 3’-terminal genome sequences of the 
BCoV-like viruses, CDNA was made by using oligonucle- 
otide DI3’(+), which binds to the 3’-poly(A) tail and pro- 
vides BamHI, Sall, Accl, Hincl, and MlulI restriction endo- 
nuclease cloning sites, amplified by PCR _ with 
oligonucleotides DI3'(+) and Pst(—), and the PCR products 
were cloned into TOPO XL cloning vector (Invitrogen). For 


PV, purified genomic RNA was reverse transcribed employ- 
ing oligonucleotide Ada3(+). The second strand was syn- 
thesized and cloned as described previously (Klausegger et 
al, 1999). The resulting plasmids were sequenced by auto- 
mated DNA sequencing. 


Phylogenetic analyses 


Nucleotide sequences for 3’ and 5’ UTRs and amino acid 
sequences for the partial 5’ ORF were aligned manually by 
using the Genetic Data Environment (GDE) (Smith et al., 
1994). Nucleotides and amino acid residues were aligned in 
a manner that minimized the number of gaps, and nucleotide 
positions between the first identical nucleotide in all posi- 
tions and the gaps were omitted from phylogenetic calcu- 
lations to minimize ambiguity. Sufficiency of the phyloge- 
netic “signal” in each aligned dataset was assessed by the 
random tree-length distribution method implemented in 
PAUP* (Version 4.0b10; Hillis and Hulsenbeck, 1992; 
Swofford, 1996). The sequence alignments contained suffi- 
cient information for valid phylogenetic analyses (g1 skew 
statistics —0.61, —0.55, and —1.033 for 5’ UTR, 3’ UTR, 
and 5’ partial ORF, respectively). Phylogenetic frameworks 
were inferred from nucleotide sequences by maximum like- 
lihood in PAUP* with settings that correspond to the 
HKY85 + G model (two-substitution types, four rate cate- 
gories following a y-distribution with no invariant sites and 
nucleotide frequencies, transition/transversion ratio, and the 
y-shape parameter estimated by maximum likelihood). 
Bootstrap proportions were calculated from 100 resampled 
datasets by using neighbor-joining (Kimura two-parameter 
model, transition/transversion ratio of 2) and maximum 
parsimony, also in PAUP*. Phylogenetic relationships 
among amino acid sequence translations for the partial 5’ 
ORF were inferred by neighbor-joining from distances cal- 
culated by using the PAM matrix of amino acid substitution 
frequencies, as implemented in PHYLIP (Felsenstein, 
1993). Bootstrap proportions were inferred from 100 re- 
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sampled datasets by using PHYLIP’s PAM distance/neigh- 
bor joining and protein parsimony methods. 


Plasmid constructs 


Construction of pGEM3Zf(—) (Promega)-based pDrep | 
(Fig. 1) has been described (Chang et al., 1994). 


Northern assay for DI RNA and virus replication 


The Northern assay for detecting reporter-containing DI 
RNA was performed essentially as previously described 
(Chang et al., 1994; Chang and Brian, 1996). Briefly, cells 
at 80% confluency (~1 X 10° cells/dish) in 35-mm-diam- 
eter dishes were infected with the indicated helper virus at 
a multiplicity of approximately 10 PFU per cell and trans- 
fected with 600 ng of pDrep!1 transcript in Lipofectin (In- 
vitrogen) at 1 hpi. For passage of progeny virus, supernatant 
fluids were harvested at 48 hpi and 500 wl was used to 
directly infect freshly confluent cells in a 35-mm dish. RNA 
extracted with Trizol (yielding approximately 10 wg per 
plate) was stored at —20°C as an ethanol precipitate. For 
Northern analysis, precisely one-fourth of the total RNA 
preparation (approximately 2.5 jg per lane) was used for 
electrophoresis in a formaldehyde-agarose gel. Approxi- 
mately | ng of transcript was loaded per lane when used as 
a size marker. RNA was transferred from the gel to Nytran 
membrane by vacuum blotting and blots were probed with 
oligonucleotide TGEV8 5’-end labeled with *P to specific 
activities of 1.5 X 10° to 3.5 X 10° cpm/pmol (Cerenkov 
counts). For quantitating viral RNA replication, the blots 
used earlier for DI RNA detection on which radioactivity 
was no longer detectible were stripped and probed with 
*?P-radiolabeled oligonucleotide 3'UTRPSEV(+), a probe 
complementary to a common 3’ UTR sequence in all helper 
viruses (Table 1). Probed blots were read with a Packard 
InstantImager Autoradiography System for quantitation and 
exposed to Kodak XAR-5 film for 6 to 48 h at —80°C for 
imaging. Images were recorded by digital scanning of films. 


Sequence of progeny DI RNAs 


To determine the 5’-terminal leader sequence of progeny 
DI RNA, TGEV8(+) and RNA from cells infected with 
VPI virus were used for RACE analysis as described above. 


Accession numbers 


GenBank Accession Nos. for the sequences studied here 
are as follows: U00735 for BCoV-Mebus, AF523843 and 
AF523847 for HCoV-OC43, AF523844 and AF523848 for 
HECoV-4408, AF523845 and AF523849 for HEV-TN11, 
AF523846 and AF523850 for ECoV-NC99, NC001846 for 
MHV-A59, AF201929 for MHV-2, M55148 and X00990 
for MHV-JHM, AJ544718 for PV, and AF124990 and 
AF207551 for SDAV. 
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